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Abstract 

In this paper we consider the solution of certain convex integer minimization problems via 
greedy augmentation procedures. We show that a greedy augmentation procedure that employs 
only directions from certain Graver bases needs only polynomially many augmentation steps 
to solve the given problem. We extend these results to convex A^-fold integer minimization 
problems and to convex 2-stage stochastic integer minimization problems. Finally, we present 
some applications of convex A^-fold integer minimization problems for which our approach 
provides polynomial time solution algorithms. 



1 Introduction 



For an integer matrix A G 'E'^^", we define the circuits C{A) and the Graver basis G{A) as follows. 
Herein, an integer vector v G Z" is called primitive if all its components are coprime, that is, 
gcd(Di, ...,v„) = l. 



Definition 1 Let A e Z'^''" and let Oj, j = 1,...,2" denote the 2" orthants of W\ Then the 
cones 

Cj := ker(A) n Oj = {z e Oj : Az = 0} 

are pointed rational polyhedral cones. Let Rj and Hj denote the (unique) minimal sets of primitive 
integer vectors generating Cj over R_|_ and Cj fl Z" over Z_|_ , respectively. Then we define 

2" 2" 

C{A) := y R, \ {0} and g{A) := |J H, \ {0} 
to be the set C{A) of circuits of A and the Graver basis GiA) of A. 



Remark 2 It is not hard to show that C{A) corresponds indeed to all primitive support- minimal 
vectors in ker(A) [7]. 



Already in 1975, Graver showed that C{A) and Q{A) provide optimality certificates for a large class 
of continuous and integer linear programs, namely for 

(LP)a,«,6,/ • niin{/(z) ■.Az = b,0<z<u,ze R!^}, 

and 

(IP)a,«.6,/ : min{/(z) : Az = b,0 < z < u, z e Z^}, 
where the linear objective function f{x) — c^x, the upper bounds vector u, and the right-hand 
side vector b are allowed to be changed [7]. A solution z° to (LP)^ u b f '^^ optimal if and only if 
there are no g € C{A) and a G M+ such that -I- ag is a feasible solution to (LP)^ „ f, ^ that 
has a smaller objective function value /(z" -I- ag) < f{z°)- Analogously, an integer solution z^ to 
(IP)^ M 6 / is optimal if and only if there are no 5 e G{A) and a £ Z+ such that z° + ag is a feasible 
solution to (IP)^ „ ft J that has a smaller objective function value f{z^ + ag) < f{z'-'). 

Thus, the directions from C{A) and G{A) allow a simple augmentation procedure that iteratively 
improves a given feasible solution to optimality. While this augmentation process has to terminate 
for bounded IPs, it may show some zig-zagging behaviour, even to non-optimal solutions for LPs 



Example 3 Consider the problem 

min{zi + Z2 - Z3 : 2zi + Z2 < 2, zi + 2z2 < 2, Z3 < 1, (zi, Z2, Z3) £ M>o} 

with optimal solution (0, 0, 1). Introducing slack variables Z4, Z5, zg we obtain the problem min{c''"z : 
Az = (2, 2, 1)T, z e R|o} with cT = (1, 1, -1, 0, 0, 0) and 

/ 2 1 1 \ 
A = 1 2 1 . 
\ 1 1 / 

The vectors (1, 0, 0, -2, -1, 0), (0, 1, 0, -1, -2, 0), (1, -2, 0, 0, 3, 0), (2, -1, 0, -3, 0, 0), (0, 0, 1, 0, 0, -1) 
together with their negatives are the circuits of A. The improving directions are given by all circuits 
V for which c^v > 0. 



Now start with the feasible solution zq = (0, 1, 0, 1, 0, 1). Following the directions (0, 1, 0, —1, —2, 0) 
and (0, 0, —1, 0, 0, 1) as far as possible, we immediately arrive at (0, 0, 1, 2, 2, 0) which corresponds 
to the desired optimal solution (0, 0, 1) of our problem. However, alternatively choosing only the 
vectors (—1, 2, 0, 0, —3, 0) and (2, —1, 0, —3, 0, 0) as improving directions, the augmentation process 
does not terminate. In our original space R^, this corresponds to the sequence of movements 

(0,1,0)^ (l,o,o)^(o,i,o)^Q,o,o)^(o,l,o)^... 

clearly shows the zig-zagging behaviour to the non-optimal point (0, 0, 0). □ 

Indeed, in order to avoid zig-zagging, certain conditions on the selection of the potential augmenting 
circuits must be imposed. As suggested in [S], one can avoid such an undcsircd convergence 

• by first choosing an augmenting circuit direction freely, and 

• by then moving only along such circuit directions that do not increase the objective value, 
that is c^g < 0, and which introduce an additional zero component in the current feasible 
solution, that is supp(z^ -I- ag) C supp(z°). After 0(n) such steps, we have again reached a 
vertex and may perform a free augmentation step if possible. 

A natural question that arises is, whether there are strategies to choose a direction from C{A) and 
G{A), respectively, to augment any given feasible solution of (LP)^ « 6 / « b / optimality 

in only polynomially many augmentation steps. In this paper, we answer this question affirmatively. 
For this let us introduce the notion of a greedy augmentation vector. 

Definition 4 Let T C M" he a set of feasible solutions, zq (z J- , f : M" M any objective function, 
and let S C M" be a (finite) set of directions. Then we call any optimal solution to 

min{/(2:o + cng) : a G 5 ^ S, zq + ag ^ T}, 

a greedy augmentation vector (from S for zq). 

Theorem 5 Let A e Z'*^", u G Q", b e Z'^ and c e Q" be given. Moreover, let f{z) = cJz. Then 
the following two statements hold. 

(a) Any feasible solution z^ to (LP)^ ^ ^ can be augmented to an optimal solution of {LP) ^ ^ ^ 
by iteratively applying the following greedy procedure: 

L Choose a greedy direction ag from C{A) and set z^ :~ z'^ + ag. 

Lf ag = 0, return zq as optimal solution. 

2. As long as it is possible, find a circuit direction g S C{A) and a > such that z^ -f ag 
is feasible, (jp ag) < c'^ z^ , and supp(z^ -I- ag) C supp(z^), and set z^ := z*^ -|- ag. 

Go back to Step 1. 
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The number of augmentation steps in this augmentation procedure is polynomially hounded 
in the encoding lengths of A, u, h, c, and . 

(b) Any feasible solution z'^ to {IP)/^ ^ ^ can be augmented to an optimal solution of {IP)j^ ^i, f 
by iteratively applying the following greedy procedure: 

Choose a greedy direction ag from Q{A) and set z'^ := z*^ + ag . 
If ag = 0, return zq as optimal solution. 

The number of augmentation steps in this augmentation procedure is polynomially bounded 
in the encoding lengths of A, u, b, c, and z'^ . 

For our proof of Theorem O we refer to Section [?7T] Note that in [3] it was shown that the Graver 
basis Q{A) aUows to design a polynomial time augmentation procedure. This procedure makes use 
of the oracle equivalence of so-called oriented augmentation and linear optimization established in 
[T5] . However, the choice of the Graver basis element that has to be used as a next augmenting 
vector using the machanism of |15| is far more technical than our simple greedy strategy suggested 
by Theorem [5l Part (b). 

In this paper, we generalize Part (b) of Theorem [5] to certain Z-convex objective functions. We say 
that a function 5 : Z ^ Z is Z-convex, if for all x,y & Z and for all < A < 1 with Xx+(l — \)y G Z, 
the inequality g{Xx -f (1 — A)y) > Xg{x) -I- (1 — A)g(y) holds. With this notion of Z-convexity, we 
generalize Part (b) of Theorem [S] to nonlinear convex objectives of the form f{c'^z, c[z, . . . , cj2:), 
where 

s 

f{yo,yi,---,ys) = ^fi{yi) + yo (1) 

2=1 

is a separable Z-convcx function and where co, . . . , Cs £ Z" are given fixed vectors. In particular, 
each function : Z ^ Z is Z-convex. When all fi = 0, we recover linear integer optimization as a 
special case. To state our result, let C denote the s x n matrix with rows ci, . . . , and let G{A, C) 
denote the Graver basis of ( c 7, ) projected onto the first n variables. As was shown in [ni[I3ji this 
finite set provides an improving direction for any non-optimal solution z^ of (IP)^ ub f- 

Theorem % Let A e Z'^>'", u S Z", 6 S Z'^, c € Q", Ci,...,Cs S Q". Moreover, let f{z) := 
f{c^z^ c\z, . . . , cjz), where f denotes a separable "L- convex function as in ^ given by a polynomial 
time comparison oracle which, when queried on x,y G , decides whether f{x) < f{y), f{x) = 
f{y), or f{x) > f{y) holds in time polynomial in the encoding lengths of x and y. Moreover, let 
H be an upper bound for the difference of maximum and minimum value of f over the feasible set 
{z : Az ~ b,0 < z < u, z £ Z"} and assume that the encoding length of H is of polynomial size in 
the encoding lengths of A,u,b,c,ci, . . . ,Cs. Then the following statement holds. 

Any feasible solution z° to {IP)ji „ / can be augmented to an optimal solution of {IP)j^ u b f 
iteratively applying the following greedy procedure: 

Choose a greedy direction ag from G{A, C) and set z'^ := z^ + ag. 
If ag = 0, return zq as optimal solution. 
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The number of augmentation steps in this augmentation procedure is polynomially bounded in the 
encoding lengths of A, u, b, c, Ci, . . . , Cs, and . 

For our proof of Thcorcm[6]wc refer to Section [5. 21 As a consequence to Theorem[6l we construct in 
Sections [5] and [3] polynomial time algorithms to solve convex iV-fold integer minimization problems 
and convex 2-stage stochastic integer minimization problems. In the first case, the Graver basis 
under consideration is of polynomial size in the input data and hence the greedy augmentation 
vector ag can be found in polynomial time. In the second case, the Graver basis is usually of 
exponential size in the input data. Despite this fact, the desired greedy augmentation vector ag 
can be constructed in polynomial time, if the fi are convex polynomial functions. Finally, we present 
some applications of convex A'^-fold integer minimization problems for which our approach provides 
a polynomial time solution algorithm. We conclude the paper with our proofs of Theorems [5] and 

m 



2 A^-fold convex integer minimization 

Let A G Z'^"'^", B e Z''*'^", and ci, . . . , £ Z" be fixed and consider the problem 

{N N ^ 

^/W -.^Bx^^ =6(°\y4xW =6W,0<a;W <u^\x'^^ € Z", i = 1, . . . , iV i , 

i=l i=l ) 

where we have 

for given convex functions /j'"* and vectors c^'^ G Z", i = 1, . . . , A^, j = 1, . . . , s. If we dropped the 

coupling constraint X^iLi Bx^^"^ ~ b^^\ this optimization problem would decompose into TV simpler 
convex problems 

min|/(') (xW) : Aa;« = b^'\0 < x^''> < u«,a;W G Z"} , i = 1, . . . , A^, 
which could be solved independently. Hence the name "iV-fold convex integer program" . 



Definition 7 The A^-fold matrix of the ordered pair A, B is the following {db + Nda) x Nn 
matrix, 



( B 


B 


B ■ 




A 





• 


• 





A 


• 


• 


V 





• 


• A 1 



For any vector x = {x^ , . . . , x^) with .t* G Z" for i ^ 1, . . . , N , we call the number \{i : x^ ^ Q}\ of 
nonzero building blocks x^ G Z" of x the type of x. 
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In [TT], it was shown that there exists a constant g{A,B) such that for all N the types of the 
Graver basis elements in Q{[A, B]^-^")) are bounded by g{A,B). In [4], this was exploited to solve 
linear TV-fold IP in polynomial time. 



Lemma 8 (Results from [4]) 



• For fixed matrices A and B the sizes of the Graver bases Q{[A, B]^'^'>) increase only polyno- 
mially in N . 

• For any choice of the right-hand side vector b, an initial feasible solution zq can be constructed 
in time polynomial in N and in the encoding length of b. 

• For any linear objective function c'^ z, this solution zq can be augmented to optimality in time 
polynomial in N and in the encoding lengths ofb, c, u, and zq. 



Using Theorem [51 we can now generalize this polynomial time algorithm to convex objectives of 
the form above. Let us prepare the main result of this section by showing that the encoding lengths 
of Graver bases from [21 [T3] increase only polynomially in N. For this, let C denotes the s x n 
matrix with rows ci , . . . , Cs . 



Lemma 9 Let the matrices A e Z'^"'^" , B e Z'^"'^" , and C e Z"''" be fixed. Then the encoding 
lengths of the Graver bases of 



( B B 

A 

A 



C 



c 



B 



A 



C 



\ 



Is J 



increase only polynomially in N . 



Proof. The claim follows from the results in [4] by rearranging the rows and columns as follows 

••• B \ 



[BOB 
A 

C Is 

A 
C 







A 
C Is j 
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This is the matrix of an iV-fold IP with A = {c i,) ^-^d with B = { b o). Hence, the sizes and the 
encoding lengths of the Graver bases increase only polynomially in N. □ 

Now that we have shown that the Graver basis is of polynomial size, we can consider each Graver 
basis element g independently and search for the best a £ Z+ such that zq + ag is feasible and has 
a smallest objective value. This can be done in polynomial time as the following lemma shows. 

Lemma 10 Let / : M ^ R &e a convex function given by a comparision oracle. Then for any given 
numbers l,u £ Z, the one- dimensional minimization problem min{/(a) : I < a < u} can be solved 
by polynomially many calls to the comparision oracle. 

Proof. If the interval [/, u] contains at most 2 integers, return Z or it as the minimum, depending 
on the values of f{l) and f{u). If the interval [I, u] contains at least 3 integers, consider the integers 
[{I + u)/2\ — 1, [{I + u)/2\ , [{I + u)/2\ + 1 £ [I, u] and exploit convexity of / to bisect the interval 
[I, u] as follows: 

If f{l{l + u)/2\ — 1) < f{[{l + u)/2\) holds, then the minimum of / must be attained in the interval 
[I, l{l + u)/2\]. If, on the other hand, /([(/ + u)/2\) > /([(^ + u)/2\ + 1), then the minimum of / 
must be attained in the interval [[(/ + u)/2\ + 1, w]. If none of the two holds, the minimum of / is 
attained in the point a = [{I + u)/2\ . 

Clearly, after 0(log(u — I)) bisection steps, the minimization problem is solved. □ 

The results in [4] together with the previous two lemmas now immediately imply the main result 
of this section. 

Theorem 11 Let A, B, C be fixed integer matrices of appropriate dimensions. Then the following 
holds. Moreover, let fj' : R ^ M 6e convex functions mapping "L to given by polynomial time 
evaluation oracles. Then the problem 

{N N \ 

(x(')) :^Sa;('' ^h''°\Ax^'^ = b^''\0 < x^''^ <u^'\x^''> G Z", i = 1, . . . , TV I , 
1=1 i=i ) 

can be solved in time polynomial in the encoding length of the input data. 

Proof. Polynomial time construction of an initial feasible solution from which we can start our 
augmentation process follows immediately from the results in [1]. 

To show that this feasible solution can be augmented to optimality in polynomial time, we note that 
by Theorem [H] that only polynomially many greedy augmentation steps are needed. By Lemma [HI 
we only need to check polynomially many directions g to search for a greedy augmentation vector. 
But this can be done in polynomial time by Lemma 1101 □ 
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3 Convex 2-stage stochastic integer minimization 



Multistage stochastic integer programming has become an important field of optimization, see 
[31 [121 E] for details. From a mathematical point of view, the data describing a 2-stagc stochastic 
integer program is as follows. Let T e Z''^™, W G Z''^", ci, . . . , G Z™, di, . . . , 4 G Z" be fixed, 
and consider the problem 

mui{E^{f'^{x,y)) : Tx + Wy = b'^ ,0 < x < u^,,0 < y < Uy,x eZ"',y E Z"}, 

where uj is some probability distribution in a suitable probability space and where / is a convex 
function of the form 

s 

r{x,y) + 
in which each f : M ^ M is a convex function. 

Discretizing the probability distribution using N scenarios, we obtain the following convex integer 
minimization problem 

minj^/W -.Tx + Wy^'^ = b^'\0 < x < u^,0 < y^'^ <uf ,x ElT ,y^'^ G Z" , i = 1, . . . , A^j , 

where we have 

s 

for given convex functions Note that fixing the first-stage decision x would decompose the 
optimization problem into iV simpler convex problems 

min{/(') (a;,y(')) : Ax^'^ = 6«,0 < < u^^\y^'^ G Z"} , 2 = 1, . . . , TV, 

which could be solved independently. However, the problem of finding a first-stage decision x with 
smallest overall costs would still remain to be solved. 



Lemma 12 (Results from jlOj ) 



• A vector (?;, Wi, . . . , Wat) is in the kernel of the matrix 



[T,W 



<N) 



( T W 

T Q W 



V T 



\ 





w J 



if and only if (w, Wi) G kcr ^[T, W] ^ for all i, that is, if Tv + Wwi = for all i. 



• The Graver bases for the matrices [T, W] decompose into a finite number of first-stage 
and second-stage building blocks that are independent on N . 
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• For any given linear objective, any given right-hand side vector and any non-optimal feasible 
solution zo, an improving vector to zq can be reconstructed from the building blocks in time 
linear in the number N of scenarios. 



Note that this finiteness result from [TU] does not imply that the Graver basis of [T, W] is of 
polynomial size in N. In fact, one can easily construct an exponential size counter-example. Before 
we present the main result of this section, let show that there exists a polynomial time optimality 
certificate also for convex 2-stage stochastic integer minimization problems of the type above, if 
the matrices T and W are kept fix. For this, let C denote the s x m matrix with rows ci, . . . , Cs, 
and let D denote the s x n matrix with rows c?i , . . . , . 

Lemma 13 The Graver bases of the matrices 

T W \ 

T W 

T W 
CD Is 
CD h 

C D Is j 

decompose into a finite number of first-stage and second-stage building blocks that are independent 
on N. 

For any given convex objective, any given right-hand side vector and any non- optimal feasible 
solution Zq, an improving vector to zq can be reconstructed from the building blocks in time linear 
in the number N of scenarios. 



[T,W,C,D]' ' 



Proof. To prove our first claim, we rearrange blocks within the matrix [T, W, C, D\ as follows: 

( T W Q 
C D Is 
T W 
C D 

T 

\ c 

which is the matrix of a 2-stage stochastic integer program with N scenarios and fixed matrices 
( ^ ) and ( ^ ) . Hence, its Graver basis consists out of a constant number of building blocks 
independent on N. This proves the first claim. 

To prove the second claim, note that the results from [SJ [T3] show that the Graver basis of 
[T, W, C, D] projected down onto the variables corresponding to T and W columns gives im- 









( w 




lie) 







■{N) 



W 
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proving directions for non-optimal solutions zq to 

min |E •^^'^ y'^'^) ■ + ^2/^'^ = b'^'Ko < X < W:,, < y < Uj,, X e Z", 2/ G Z", i = 1, . . . , 7v| . 

Thus, these directions consist out of only a constant number of building blocks independent on A^. 
Let z = {x, y*-^-* , . . . , y*-^-*) be a feasible solution and let g = {v, w*^^^ , ■ • ■ , w*-^-*) be an augmenting 
vector formed out of the constant number of first-stage and second-stage building blocks. To be an 
improving direction, g must satisfy the following constraints: 

• T{x + v) + W{y^'''> +w^^'>) = 6W, i = l,...,N, 

• < X + V < Ux, 

• < y(') + < u'y \ i = l,...,N, 

For each of the finitely many first-stage building blocks perform the following test: If < x+v < Ux, 
try to find suitable second-stage building blocks satisfying the remaining constraints, which for fixed 
V simplify to 

• Tv + Ww^''> ^Q,i = l,...,N, 

• < y(') < u^y''\ i = 1, . . . ,iV, 

• Ef=i + y« + ^^■'') < e;Ii f^'^ y^'^)- 

For fixed v, this problem decomposes into N independent minimization problems: 

min |/(') (^x + V, y(*) + w^*)) : Tv + Ww^"^ = 0, < y^'^ + ly^'^ < } = 1, • ■ - ^• 

If for those optimal values EiLi + "^i + w*-'^) < Ei3=i (^^i 2^''*'') holds, we have found 

an improving vector g = {v^w''^\ . . . ,10'-^^) for zq. If one of these minimization problems is in- 
feasible or if Ei^i Z^'"* + y^'"* + w'*-') > Ei^i Z^'"* (^1 y''*')! then no augmenting vector for zq 
can be constructed using the first-stage building block v. If for no first-stage building block v, an 
augmenting vector can be constructed zq must be optimal. If there was an augmenting vector for zq 
with some first-stage building block v, this vector or even a better augmenting vector would have 
been constructed by the procedure above when the first-stage building block v was considered. □ 

Note that the augmenting vector constructed in the proof of the previous lemma need not be a 
Graver basis element (it may not be minimal) , but every Graver basis element could be constructed, 
guaranteeing the optimality certificate. It remains to show how to construct a greedy augmentation 
vector from the building blocks from the Graver basis. Note that the procedure in the previous proof 
constructs an augmenting vector also for a fixed step length a. To compute a greedy augmentation 
vector, however, one has to allow a to vary. But then, the minimization problem does not decompose 
into N independent simpler problems. It is this difficulty that enforces us to restrict the set of 
possible convex functions. 
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Definition 14 We call a convex function f : M™+" R that maps 2™+" to Z splittable, if 

for all fixed vectors x G Z™, y, 91,92 G Z", and for all finite intervals [l,u\ C M, there exists 
polynomially many (in the encoding length of the problem data) intervals /i, . . . ,/r such that 

r 

• = U Ir, 

i=l 

• /i n n Z = for all 1 < i < j < r, and 

• for each j = 1, . . . , r, either f{x, y + agi) < f{x, y + 032) or f{x, y + agi) > f{x, y + a92) 
holds for all a ^ Ij. 

Note that convex polynomials of fixed maximal degree k are splittable, as f{x, y+agi)— /(x, y+ag2) 
switches its sign at most k times. Hence each interval [I, u] can be split into at most k + 1 intervals 
with the desired property. With the notion of splittable convex functions, we can now state and 
prove the main theorem of this section. 

Theorem 15 Let T , W , C , D he fixed integer matrices of appropriate dimensions. Then the 
following holds. 

(a) For any choice of the right-hand side vector h, an initial feasible solution zq to 

( ^ 

mini ■ Tx + Wy*-'^ = b'-'\o < x<u^,0< y''> < ul;\x £ Z"',y'-'^ £ 1" ,i = 1, . . . , N 

can he constructed in time polynomial in N and in the encoding length of the input data. 

(b) Then, for any choice of splittable convex functions , this solution zq can be augmented to 
optimality in time polynomial in the encoding length of the input data. 

Proof. Let us prove Part (b) first. This proof follows the main idea behind the proof of Lemma [T51 
Let z = (x, y*-^-*, . . . , y*-^-*) be a feasible solution and let g = (w, w'^^\ . . . , w'^^^) be an augmenting 
vector formed out of the constant number of first-stage and second-stage building blocks. Again, 
for fixed v, we wish to consider each scenarios independently. For this, note that the possible step 
length a G Z_|_ is bounded from above by some polynomial size bound Ua , since our feasible region 
is a polytope. Since the convex functions /^'' are splittable, we can for each scenario partition the 
interval [0,Wq] into polynomially subintcrvals /i.i, . . . , /^.r^ such that for each interval lij there is 
either no building block leading to a feasible solution or a well-defined building block Wij with 
Tv + Wwij = and < y -f- awij < w'*' that minimizes /'*^ {x + v, y''^ -f- awi^j ) for all a G . 

Taking the common refinement of all intervals lij, i = 1, . . . , A^, j = 1, . . . , r^, one obtains poly- 
nomially many intervals Ji, . . . , Jt, such that for each interval Ji and for all a G Ji, there is a 
well-defined building block for each scenario minimizing the function value. For this fixed vec- 
tor 9 = (w, . . . , w*^^)) we then compute the best a G Ji, and then compare these values 
'^i^i (x -|- af , y^'' -t-au"^'') to find the desired greedy augmentation vector. Applying Theo- 
rem [51 this proves Part (b). 
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Finally, let us prove Part (a). For this, introduce nonnegative integer slack- variables into the 
second-stages to obtain a linear IP with problem matrix 



■(N) 



/ T W Id -Id 
TOO W Id -I 



Id 








[T,{WJd,-Id)] 



\T 00 



W Id -Id / 



whose associated Graver basis is formed out of only constantly many first- and second-stage building 
blocks. Using this extended formulation, we may immediately write down a feasible solution. Using 

(AT) 

only greedy directions from the Graver basis of [T, {W, Id, —Id)] , we can minimize the sum of 
all slack- variables in polynomially many augmentation steps. Part (b) now implies that an optimal 
solution to this extended problem can be found in polynomial time. If all slack-variables are 0, we 
have found a feasible solution to our intial problem, otherwise the initial problem is infeasible. □ 

Let us conclude with the remark that these polynomiality results for convex 2-stage stochastic 
integer minimization can be extended to the multi-stage situation by applying the finiteness results 
from [2]. 



4 Some Applications 

Consider the following general nonlinear problems over an arbitrary set J- C Z" of feasible solutions: 

Separable convex minimization: Find a feasible point x G J- minimizing a separable convex 
cost function f{x) := fii^i) with each fi a univariate convex function. It generalizes 

standard linear optimization with cost /(x) = X]"=i '^J-^* recovered with fi{xi) := cjxi for 
some costs c^. 

Minimum /p-distance: Find a feasible point x G minimizing the Zp-distancc to a partially 
specified "goal" point x S Z". More precisely, given 1 < p < oo and the restriction xi := 
{xi : i G I) of X to a, subset / C {1, . . . ,n} of the coordinates, find x £ J- minimizing the 
Zp-distancc — x/Hp := foi' 1 < P < oo and Hx/— a;/||oo := niaxjg/ jxi — iij 



Note that a common special case of the above is the natural problem of /p-norm minimization over 
J-', min{||a;||p : x S J-"}; in particular, the Ztx^-norm minimization problem is the min-max problem 
minjmax"^-!^ \xi\ : x G J-}. 

In our discussion of iV-fold systems below it will be convenient to index the variable vector as 
X — {x^ , . . . , x^) with each block indexed as = {xi^i, . . . , Xi_„), i = 1, . . . , N. 

We have the following corollary of Theorem llll which will be used in the applications to follow. 



for p ~ oo. 
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Corollary 16 Let A and B be fixed integer matrices of compatible sizes. Then there is an algorithm 
that, given any positive integer N, right-hand sides V , and upper bound vectors u\ of suitable 
dimensions, solves the above problems over the following set of integer points in an N-fold program 

N 

T = {a;= (a;\...,a;^) gZ^^" : ^Sa;*=6", Ax' ^ b\ Q < < u\ i^l,...,N} (2) 

1=1 

in time which is polynomial in N and in the binary encoding length of the rest of the input, as 
follows: 

1. For i = I, . . . , N and j = 1, . . . , n, let fi_j denote convex univariate functions. Moreover, let 
f{x) := X]fc=i Sj=i fi,j(^i,j) given by a comparision oracle. Then the algorithm solves the 
separable convex minimization problem 

{N n 

2. Given any I C {1,...,7V} x {l,...,7i}, any partially specified integer point xj := [xij : 
{i,j) G and any integer l<p<ooorp = oo, the algorithm solves the minimum 
Ip-distance problem 

min {||.T/ — x/||p : x £ T} . 
In particular, the algorithm solves the Ip-norm minimization problem min{l|a;||p : x G J-}. 

Proof. Consider first the separable convex minimization problem. Then this is just the special 
case of Theorem [TT] with Cj := Ij the standard j-th unit vector in Z" for j = 1, . . . , n and := 
in Z" for i = 1, . . . , N. The objective function in Theorem [TT] then becomes the desired objective, 

N N n N n 

i—l i—1 j — 1 i—1 j — 1 

Next consider the minimum Zp-distance problem. Consider first an integer 1 < p < oo. Then we 
can minimize the integer- valued p-th power ||a;||^ instead of the /p-norm itself. Define 

J'^^^^''^' — \ 0, otherwsie. 
With these fij, the objective in the separable convex minimization becomes the desired objective, 



N 



Next, consider the case p — oo. Let w :~ max{|iii.j| : i ~ 1, . . . , N, j ^ 1, . . . ,n} he the maximum 
upper bound on any variable. We may assume w > else T C {0} and the integer program is 
trivial. Choose a positive integer q satisfying q\og{l + {2w)^^) > \og{Nn). Now solve the minimum 
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iq-distance problem and let x* G T he an optimal solution. We claim that x* also minimizes the 
Zoo-distance to x. Consider any x £ J-. By standard inequalities between the loo and Iq norms, 

\\x*i-Xl\\oc < \\x*j-Xl\\q < \\XI-Xi\\q < (Nn)^ \\xi - Xl\\oo ■ 

Therefore 

\\x*i - xiWoo ~ \\xi - xiWoo < {(Nn)-^ -l)\\xi -xiWoo < iiNn)^-l)2w < 1 

where the last inequality holds by the choice of q. Since \\x*j — a;/||oo and — S/Hoo are integers 
we find that indeed — xi\\ao < ||a^/ — S/Hoo holds for sll x G J- and the claim follows. □ 



4.1 Congestion-avoiding (multi-way) transportation and routing 

The classical (discrete) transportation problem is the following. We wish to transport commodities 
(in containers or bins) on a traffic network (by land, sea or air), or route information (in packets) 
on a communication network, from n suppliers to N customers. The demand by customer i is di 
units and the supply from supplier j is Sj units. We need to determine the number Xi_j of units to 
transport to customer i from supplier j on channel i <— j subject to supply-demand requirements 
and upper bounds Xi^ < Uij on channel capacity so as to minimize total delay or cost. The classical 
approach assumes a channel cost Ci^j per unit flow, resulting in linear total cost X]i3=i Sj=i 
But due to channel congestion when subject to heavy traffic or heavy communication load, the 
transportation delay or cost on a channel are actually a nonlinear convex function of the flow 
over it, such as fijixij) = Cij\xij\°''--^ for suitable aij > 1, resulting in nonlinear total cost 
J2i j fi,j{xi.j), which is much harder to minimize. 

It is often natural that the number of suppliers is small and flxed while the number of customers is 
very large. Then the transportation problem is an A'^-fold integer programming problem. To see this, 
index the variable vector ) with x^ = {xiSi ■ ■ ■ 1 Xi,n) and likewise for the upper 

bound vector. Let := di ior i ^ 1, . . . , N and let 6° := (si, . . . , s„). Finally, let A = (1, ... , 1) 
be the 1 x n matrix with all entries equal to 1 and let B be the n x n identity matrix. Then the 
A^-fold constraints Ax^ = V , i = 1, . . . , N and B{J2^=i x^) = &° represent, respectively the demand 
and supply constraints. The feasible set in ([2]) then consists of the feasible transportations and the 
solution of the congestion- avoiding transportation problem is provided by CoroUarv 1161 part 1. So 
we have: 

Corollary 17 Fix the number of suppliers and let fij, i = 1, . . . , N , j = 1, . . . ,n, denote convex 
univariate functions. Moreover, let f{x) := X]il=i Sj=i fi,ji^i,j) given by a comparision oracle. 
Then the congestion- avoiding transportation problem can be solved in polynomial time. 

This result can be extended to multi-way (high-dimensional) transportation problems as well. In 
the 3-way line-sum transportation problem, the set of feasible solutions consists of all nonncgative 
integer L x M x N arrays with specified line-sums and upper bound (capacity) constraints, 

^i.k ; ^ ^ X^ jj^ — t^ j , Xij_Ji ^ '^ijj.k } • 
k 

(3) 



i 3 
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If at least two of the array-size parameters L, M, N are variable then even the classical linear 
optimization problem over is NP-hard [5]. In fact, remarkably, every integer program is a 3 x 
M X N transportation program for some M and N [3]. But when both L and M are relatively 
small and fixed, the resulting problem over "long" arrays, with a large and variable number N of 
layers, is again an A'^-fold program. To see this, index the variable array as a; = (a;-'^, . . . , x^) with 
a;* = {xi,i^i, . . . ,XL,M,i) and likewise for the upper bound vector. Let A be the (L + M) x LM 
incidence matrix of the complete bipartite graph Kl,m and let B be the LAI x LM identity 
matrix. Finally, suitably define the right-hand side vectors 6'', h ^ 0, . . . , N in terms of the given 
line sums rj^k, Si^k, and ti^j. Then the n-fold constraint -B(X]/^=i ^'^) ~ ^° represents the line-sum 
constraints where summation over layers occurs, whereas Ax^ = , h = 1, . . . , N, represent the 
line-sum constraints where summations are within a single layer at a time. Then we can minimize 
in polynomial time any separable convex cost function Xii^i Sjli SfeLi fi,j,k{xi,j,k) over the set 
of feasible transportations in ([3]). So we have: 

Corollary 18 Fix any L and M and let fij_k, i ~ ^, ■ ■ ■ i L, j = 1, . . . , M , k ~ 1, . . . , N , denote 
convex univariate functions. Moreover, let f(x) := Yld=i Sj=i SfcLi fi.j,k{xi.j^k) be given by a 
comparision oracle. Then the congestion-avoiding S-way transportation problem can be solved in 
polynomial time. 

Even more generally, this result holds for "long" d-way transportations of any fixed dimension d 
and for any hierarchical sum constraints, see Section [4.31 below for the precise definitions. 

4.2 Error-correcting codes 

Linear-algebraic error correcting codes generalize the "check-sum" idea as follows: a message to be 
communicated on a noisy channel is arranged in a vector x. To allow for error correction, several 
sums of subsets of entries of x are communicated as well. Multi-way tables provide an appealing way 
of organizing the check-sum protocol. The sender arranges the message in a multi-way Mi x ■ • ■ x Md 
array x and sends it along with the sums of some of its lower dimensional sub-arrays (margins). 
The receiver obtains an array x with some entries distorted on the way; it then finds an array 
X having the specified check-sums (margins), that is /p-closest to the received distorted array i, 
and declares it as the retrieved message. For instance, when working over the {0, 1} alphabet, the 
useful Hamming distance is precisely the Zi-distance. Note that the check-sums might be distorted 
as well; to overcome this difficulty, we determine ahead of time an upper bound U on all possible 
check-sums, and make it a fixed part of the communication protocol; then we blow each array to 
size (Ml -I- 1) X • • • x (Md + 1), and fill in the new "slack" entries so as to sum up with the original 
entries to U. 

To illustrate, consider 3- way arrays of format Lx M x N (already augmented with slack variables). 
Working over alphabet {0, . . . define upper bounds Uij,k ■= u for original message variables 
and Uij^k := U for slack variable. Then the set of possible messages that the receiver has to choose 
from is 

^ := e Z^^^^^^ : ^ x,,j,k = ^^J-^fe = H ^^J.fc = U,Q< x^,,,k < u,,j,k } ■ (4) 

i j k 
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Choosing L and M to be relatively small and fixed, T is again the set of integer points in an A''- 
fold system. Corollary [16] part 2 now enables the efficient solution of the error- correcting decoding 
problem 

min{||i — i||p : x Cz J-} . 
Corollary 19 Fix L, M . Then 3-way Ip error- correcting decoding can be done in polynomial time. 

4.3 Hierarchically-constrained multi-way arrays 

The transportation and routing problem, as well as the error-correction problem, have very broad 
and useful generalizations, to arrays of any dimension and to any hierarchical sum constraints. We 
proceed to define such systems of arrays. 

Consider c?-way arrays x = {xi-^^,,,^i^) of size Mi x • • • x Md- For any d-tuple (ii, . . . ,id) with ij S 
{1, . . . , Mj} U {+}, the corresponding margin Xi^^,,,_i^ is the sum of entries of x over all coordinates 
j with ij = +. The support of (zi, . . . , id) and of Xi^^...^i^ is the set supp(zi, . . . ,id) := {j : ij ^ +} 
of non-summed coordinates. For instance, ifa:isa4x5x3x2 array then it has 12 margins 
with support H = {1, 3} such as x^^+^2,+ = Y^i.2=i X]i4=i 2^3,12,2,44 • Given a family Ti of subsets of 
{1, . . . , c?} and margin values for all tuples with support in 7i, consider the set of integer 

nonncgative and suitably upper-bounded arrays with these margins, 

■= {x eZ^'^'^'"^^^'' : Xii,...,,;^ = , supp(ii, . . . e H, < < . 

The congestion-avoiding transportation problem over J-'n is to find x G J-n minimizing a given 
separable convex cost fii,...,ia{xi-^^...^i^). The error-correcting decoding problem over J-'n is 

to estimate an original message as i S J-n minimizing a suitable /p-distance Hi — to a received 
message x. 

Again, for long arrays, that is, of format Mi x ■ • • x Md-i x with d and Mi, . . . , Md-i fixed and 
only the length (number of layers) variable, the set J-n is the set of feasible points in an iV-fold 
systems and, as a consequence of Corollarv ll61 we can solve both problems in polynomial time. 

Corollary 20 Fix any d, Mi, . . . , Md-i and family Ti. of subsets of {1, . . . , d}. Then congestion- 
avoiding transportation and error- correcting decoding over Tt-l can be solved in polynomial time for 
any array length Md '■= N and any margin values Vi-^^,,,_i^ for all tuples (ii, . . . ,id) with support in 

n. 



5 Proofs of Theorems [5] and [6] 

In this section we finally prove Theorems [5] and [6l For this, we employ the following fact. 

Lemma 21 (Theorem 3.1 in Ahuja et al. [l]) Let H be the difference between maximum and 
minimum objective function values of an (integer valued) optimization problem. 
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Suppose that is the objective function value of some solution of a minimization problem at 
the k-th interation of an algorithm and f* is the minimum objective function value. Furthermore, 
suppose that the algorithm guarantees that for every iteration k, 

if" - > f3if' - /*) 

(i.e., the improvement at iteration k+l is at least (3 times the total possible improvement) for some 
constant < /3 < 1 (which is independent of the problem data). Then the algorithm terminates in 
0((logiJ)//3). 

5.1 Proof of Theorem [5] 

Let A denote the least common multiple of all non- vanishing maximal subdeterminants of A. Note 
that the encoding length log A is polynomially bounded in the encoding lengths of the input data 
A, M, h and c. Hence, the objective function values of two vertices are either the same or differ by 
at least 1/A. 

Let /° = /S. ■ c^ denote the normalized objective value of the initially given feasible solution 
and by . . . denote the normalized objective values of the vertices z^, z^, . . . that we reach at 

the end of the second steps of the augmentation procedure. Note that the difference H between 
maximum and minimum normalized objective function values of (LP)^ ^ ^ has an encoding length 
log-ff that is polynomially bounded in the encoding lengths of the input data A, u, b and c. We 
now show that 

(/'-/'+')>/3(/'-r) 

holds for 0</3=l/n<l and conclude by Lcmma[2ll that wc only have to enumerate 0((log H)n) , 
that is polynomially many, vertices. 

Cosider the vector z* — z*^ £ ker(A). There is some orthant Oj such that z* - z*^ e ker(A) n Oj. 
Hence, we can write 

n 

z* - = X! 

1=1 

for some at G R+ and g,; G C{A) CiOj, i = 1, . . . ,n. As Uigi has the same sign pattern as z'^ — z*, 
one can easily check that the components of z^ + aigi lie between the components of z^ and of z*. 
Hence they are nonnegative. As gi £ ker(y4), we have Agi = and thus A{z^ + = Az^ = b 
for any choice of i ~ 1, . . . ,n. Consequently, z*^ + aigi is a feasible solution for any choice of 
i = 1, . . . , n. Finally, we have 

n n 

^■c^{z^ -z*) = Y,^- = 5] A • cT(z'= - (z^ + a,g{)) 

i=l 1=1 

from which wc conclude that there is some index iq such that 

A.cT(z'=-(z'= + a.„5.o)) = A.cV-a,„g,J > i V A ■ c^-a.^.) = - A ■ c-^ [z^ - z*) = -{f'^ - f*)- 

n. '—^ n. n 
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Note that a greedy choice for an augmentating vector cannot make a smaller augmentation step 
than the vector aioffio- Thus, 

n 

This proves Part (a). 

The proof of Part (b) is is nearly literally the same. Clearly, in the integer situation, we may choose 
A = 1. If z-'^, . . . denote the vectors that we reach from our initial feasible solution via greedy 
augmentation steps, we only have to be careful about the choice of f3. In the integer situation, we 
need to choose /3 = l/(2n — 2), since for the integer vector z* — S ker(A) n at most 2n — 2 
vectors from the Hilbert basis of Cj = ker(A) n Oj are needed to represent each lattice point in 
Cj n Z" as a nonnegative integer linear combination of elements in Q{A) n Oj [16] . Thus, we need 
to apply 0((log H){2n — 2)) = 0((log H)n) augmentation steps, a number being polynomial in the 
encoding length. □ 



5.2 Proof of Theorem [6] 

In Oils], it was shown that G{A, C) allows a representation 

2()i+,s)-2 

{z*-z\-C{z*-z'^))= a,i9^,~Cg,), 

where each on G Z+ and where each —Cgi) lies in the same orthant as [z* — z'"', —C{z* — z^)). 
It follows again from the results in |16j that at most 2 (n + s) — 2 summands are needed. Similarly to 
the proof of Theorem[5l we can already conclude from this representation that z^ + aigi is feasible 
for alH = 1, . . . ,2(n + .s) - 2. 

Moreover, in [13| it was shown that for such a representation superadditivity holds, that is, 

2(?i+,s)-2 

/V)-/V)> E [/v+"..9.)-/V)] 

i=l 

and thus, rewritten, 

2(n+s)-2 

Therefore, there is some index io such that 

Z[n + sj — 2 Z[n + s) — 2 

and the result follows from Lemma [^T] □ 
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